Poisson factor models with applications to non-normalized microRNA profiling

نویسندگان

  • Seonjoo Lee
  • Pauline E. Chugh
  • Haipeng Shen
  • R. Eberle
  • Dirk P. Dittmer
چکیده

MOTIVATION Next-generation (NextGen) sequencing is becoming increasingly popular as an alternative for transcriptional profiling, as is the case for micro RNAs (miRNA) profiling and classification. miRNAs are a new class of molecules that are regulated in response to differentiation, tumorigenesis or infection. Our primary motivating application is to identify different viral infections based on the induced change in the host miRNA profile. Statistical challenges are encountered because of special features of NextGen sequencing data: the data are read counts that are extremely skewed and non-negative; the total number of reads varies dramatically across samples that require appropriate normalization. Statistical tools developed for microarray expression data, such as principal component analysis, are sub-optimal for analyzing NextGen sequencing data. RESULTS We propose a family of Poisson factor models that explicitly takes into account the count nature of sequencing data and automatically incorporates sample normalization through the use of offsets. We develop an efficient algorithm for estimating the Poisson factor model, entitled Poisson Singular Value Decomposition with Offset (PSVDOS). The method is shown to outperform several other normalization and dimension reduction methods in a simulation study. Through analysis of an miRNA profiling experiment, we further illustrate that our model achieves insightful dimension reduction of the miRNA profiles of 18 samples: the extracted factors lead to more accurate and meaningful clustering of the cell lines. AVAILABILITY The PSVDOS software is available on request.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Systematic enrichment analysis of microRNA expression profiling studies in endometriosis

Objective(s): The purpose of this study was to conduct a meta-analysis on human microRNAs (miRNAs) expression data of endometriosis tissue profiles versus those of normal controls and to identify novel putative diagnostic markers. Materials andMethods: PubMed, Embase, Web of Science, Ovid Medline were used to search for endometriosis miRNA expression profiling studies of endometriosis. The miRN...

متن کامل

An Efficient Algorithm for General 3D-Seismic Body Waves (SSP and VSP Applications)

Abstract The ray series method may be generalized using a ray centered coordinate system for general 3D-heterogeneous media. This method is useful for Amplitude Versus Offset (AVO) seismic modeling, seismic analysis, interpretational purposes, and comparison with seismic field observations.For each central ray (constant ray parameter), the kinematic (the eikonal) and dynamic ray tracing system ...

متن کامل

Determining the correlated factors of breast cancer recurrence by Poisson Beta-Weibull non- mixture cure model

Introduction: Therapies for many of diseases, especially cancer, have been improved significantly in the recent years, resulting in an increase in the number of patients who do not experience mortality. Therefore, the application of cure models is more suitable for survival analysis in this population than the usual survival models are. The aim of this study was to estimate the recurrence-free ...

متن کامل

The Exponentiated Poisson-Lindley Distribution; Features and Applications in Reliability

Abstract. In this paper a new three-parameter lifetime distribution named “the Exponentialed Lindley-Poisson (E-LP) distribution” has been suggested that it has an  increasing, decreasing and invers bathtube hazard rate depending on the parameter values. The (E-LP) distribution has applications in economics, actuarial modeling, reliability modeling, lifetime and queuing problems and biological ...

متن کامل

MDMA Abuse in Relation to MicroRNA Variation in Human Brain Ventral Tegmental Area and Nucleus Accumbens

Aim 3,4-methylenedioxymethamphetamine (MDMA) is one of the most widespread illegal drugs, used particularly by young people in the 15-34 age group. MicroRNAs (miRNAs) are endogenously synthesized, non-coding and small RNAs that post-transcriptionally regulate their target genes’ expression by inhibiting protein translation or degradation. miRNAs are increasingly implicated in drug-related...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 29 9  شماره 

صفحات  -

تاریخ انتشار 2013